® 



J 



Europ&lsches Patentamt 
European Patent Office 
Office europ6en des br vets 



© Publication number: 



0 285 123 

A2 



EUROPEAN PATENT APPLICATION 



© Application number: 88106163.5 
© Date of filing: 30.03.88 



© int. CI.*-. C12N 15/00 , C12P 21/00 



A request for correction of the originally filed 


© Applicant: SUOMEN SOKERI OY 


description pages 9 (line 12 and 15), 17 (line 24), 


Kyllikinporttl 2 


21 (column T) and claim 1 (line 8) has been filed 


SF-00240 HelslnW(Fl) 


pursuant to Rule 88 EPC. A decision on the 




request will be taken during the proceedings 


© Inventor: Lehtovaara, P&lvl 


before the Examining Division (Guidelines for 


Tehtaank. 4A2 


Examination in the EPO, A-V, 2.2). 


SF-00140 Helslnki(Fl) 




Inventor: Knowles, Jonathan 




Rltokalllontie 16A 


© Priority: 0&04.87 US 34819 


SF-00330 Helslnkl(FI) 


© Date of publication of application: 


Inventor: Kolvula, Anu 


KoilllsvMyli&2A24 


05.10.88 Bulletin 88/40 


SF-00200 HelslnW(FI) 




Inventor: Bamford, Jaana 


© Designated Contracting States: 


Kartanont 14A15 


AT BE CH OE ES FR GB GR IT LI LU NL SE 


SF-0044O J£rvenpiUi(FI) 




Inventor: Refnikalnen, Tapani 




Klaarant7A10 




SF-00200 Helslnki(FI) 




© Representative: Masch, Karl Gerhard et al 




Patentanwaite AndreJewsW, Honke & Partner 




Theaterplatz 3 Postfach 10 02 54 




D-4300 Essen 1(DE) 



CM 

m 
oo 

CM 



® A method for complete mutagenesis of nucleic acids. 

© The invention relates to methods for Introducing alt types of base substitution mutations randomly to any 
nucleic acid. The methods comprise 

(a) preparation of the target nucleic acid to be mutagenized in the form of a single stranded template; 

(b) extension of an oligonucleotide hybridized to this template to generate a population of molecules, 
| sach copied from the template, and comprising molecules terminating at all possible nucleotide positions 
■ within a predetermined region; 

(c) misincorporation of nucleotides opposite to each position of the template using polymerase and 
the variable 3' ends generated in (b) as primers; 

(d) completion of the molecules to forms that can be amplified by molecular cloning in a suitable 
host; and 

(e) expression of the mutagenized DNA using a suitable host vector system allowing the screening 
for an altered gene or gene product. ~ 
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A method for complete mutagen sis of nucleic acids 

This invention relates to methods for introducing in vitro all types of base substitution mutations 
randomly to any nucleic acid. Using this method it is possible to develop novel polypeptides or improved 
genetic expression systems for biotechnical applications. The method can also be used for identification of 
functionally important regions in proteins and gene regulatory regions. In one specific aspect, this invention 

5 relates to the generation of a population of DNA molecules, copied from a template, which comprise 
molecules terminating at all possible nucleotide positions within a predetermined target region, misincor- 
porating nucleotides opposite to each position on the template and completing said molecules to forms 
which can be amplified by molecular cloning, expressing the mutagenized DNA, and screening for an 
altered gene or gene product 

io Proteins and peptides play a central role in all biotechnical and pharmaceutical applications. Naturally, 
the properties of a particular protein should be chosen such that it functions under the conditions of the 
specific application as well as possible. In practice this is often achieved by screening a number of 
organisms for one producing a protein with the most appropriate properties. However, enzymes and other 
polypeptides have been selected during natural evolution for a specific molecular structure which allows 

is them to function optimally in their respective host cells, and to interact with many other components of the 
cell. Therefore a protein isolated from a living organism is very unlikely to be perfectly suited to use in a 
given application. 

The specific requirements of various technological applications thus create needs for altering some of 
the natural properties of the polypeptides used. Optimal performance of an enzyme in a biotechnical 
20 process may require alterations in properties such as pH-optimum, pH stability, thermostability, Stability 
against oxidation or denaturation, substrate specificity, turnover number, spectrum of products, inhibition 
properties etc. 

The primary structure of a polypeptide, as coded for by its respective gene, contains the determinants 
for its folding to a specific three-dimensional structure, and also for all of its properties. Therefore, by 

25 changing the base sequence of the respective gene, all properties of a protein can be in prfnciple modified 
towards a desired optimum. Gradual accumulation of random base substitution mutations is an essential 
mechanism creating genetic diversity, which serves as material for selection under various conditions during 
the natural evolution of proteins. Any method able to mimic this process would provide a good basis for 
development of novel proteins for tehnological applications. The essential features of such methods include 

30 the random and efficient generation of base substitutions and the optimization of the ratio between the 
twelve different types of base substitutions. 

A specific replacement of only one functionally critical amino acid by site specific oligonucleotide 
mutagenesis (see, Carter, P., Bedouelle H. and Winter, G., Improved oligonucleotide site-directed 
mutagenesis using M13 vectors 13 Nucleic Acids Res. 4431-4443 (1985); Smith M., In vitro mutagenesis, 

35 19 Ann. Rev. Genet 423-462 (1985); Leatherbarrow R.J. and Fersht, A.R., Protein engineering, 1 Protein 
Engineering 7-16 (1986)) has been demonstrated to be sufficient to bring about the desired modification of 
protein properties in, for example, the following cases: an altered specificity of trypsin, an altered pH 
dependence or enhanced resistance to chemical oxidation of subtilisin and an increased thermostability of 
lysozyme or dlhydrofolate reductase (Leatherbarrow and Fersht 1986). However, the above enzymes all 

40 belong to the exceptionally rare examples where extensive information is available concerning the en- 
zymoiogy and three dimensional structure. Even in such cases, the choice of a suitable amino acid 
replacement still often involves much trial and error. This is due to the fact that precise prediction of which 
mutation is required to bring about a desired change in the properties of a protein is usually impossible, 
since our knowledge concerning the natural laws governing the structure and function of proteins is still very ^ 

45 perfunctory. 

Application of heteroduplex or cassette type of oligonucleotide mutagenesis to introduce several 
different mutations to complete genes consisting of hundreds or thousands of nucleotides is extremely * 
difficult and expensive if not impossible in practice (Hutchison, OA, Nordeen S.K., Vogt, K. and Edged, 
M.H., A complete library of point substitution mutations in the glucocorticoid response element of mouse 
so mammary tumor virus . 83 Proc. Nati. Acad. Sci. USA, 710-714 (1986)). 

The pri r art describes two different methods which enable making mutations to long DNA targets: 
chemical mutagenesis (Shortle D„ and Nathans, D. Local mutagenesis: a method for generating viral 
mutants with base substitutions in preselect d regions of the viral genome, 75 Proc. Natl. Acad. Sci. USA, 
2170-2174 (1978); Hirose S. ( Takeuchi, K. and Suzuki, Y. In vitro characterization of the fibroin gene 
promoter by the use of single-base substitution mutants, 79 Proc. Nati. Acad. Sci. USA 7258-7262 (1982)) 
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and enzymatic misincorporation of a noncomplementary nucleotide or thlonucleotide (Zakour R.A. and Loeb 
LA. Site-specific mutagenesis by rror-directed DNA synthesis, 295 Nature 708-710 (1982); Shortle D., 
Grisafi, P., Benkovic, S.J. and Botstein, D., Gap misr pair mutagenesis: efficient site-directed induction of 
transition, transversion and frameshift mutations in vitro , 79 Proc. Nati. Acad. Sci. USA, 1588-1592 (1982)). 

5 Using the chemical mutagenesis method it Is only possible to obtain a few types of base substitutions with 
each type of chemical modification, and the distribution of the mutations tends to be clustered in "hot 
spots". The chemical approach has been recently modified to give a wider range of mutations by making 
use of several different chemicals and by separate mutagenesis of both strands of DNA (Myers R.M., 
Lerman, L.S. and Maniatis, T., A general method for saturation mutagenesis of cloned DNA fragments, 229 

ro Science 242-247 (1985). This method, while giving all possible point mutations, is laborious, time- 
consuming and involves use of mutagenic chemicals which are hazardous to health, in addition, it is not 
possible to optimize precisely the relative ratio, distribution and efficiency of all the different classes of 
mutants. 

Using the gap misrepair method as described in Botstein, D. and Shortle, D., Strategies and applica- 

75 tions of in vitro mutagenesis, 229 Science 1191-1201 (1985); Shortle, D., and Lin, B., Genetic analysis of 
Staphylococcus nuclease: identification of three intragenic "global" suppressors of nuclease-minus muta- 
tions, 110 Genetics, 539-555 (1985), a single nick is introduced into a covalently closed circular plasrnid 
with DNasel. The nicks are enlarged to short gaps where the 3'OH serves as the primer for misincorporation 
by polymerase. The disadvantage of the gap misrepair method Is that the vector is also mutagenized, which 

20 decreases the efficiency of the method for the target region, and requires a time-consuming recloning step 
for the target DNA. Moreover, since both strands of the plasmld are mutagenized simultaneously, strand 
selection methods cannot be used to increase the efficiency of mutagenesis. It is not possible to localize 
mutations only to the region of DNA under study, such as the coding region or the promoter. 

These facts reduce the -efficiency of the gap misrepair method for generating the mutant library. 

25 Assuming a vector size of 3 kb, an insert size of 1 kb and a 100% efficient method, the maximal theoretical 
yield of mutants is only 12.5%. When a 1 00O-nucleotide region Is mutagenized, the number of different 
possible point substitution mutants is 3000. Using the gap misrepair method, a complete library would be 
comprised of 24,000 clones (assuming 100% efficiency). Because the method for identifying mutant 
proteins can be laborious, it is of crucial importance that the mutagenesis method is highly efficient so that 

30 the library containing ail mutants is as smalt as possible. 

We have developed a complete mutagenesis method which makes it possible to saturate any cloned 
gene with random mutations. 

The complete mutagenesis method described in this application allows rapid and efficient mutagenesis 
of defined regions of DNA. Since the mutagenized DNA strand can effTdentiy be selected for by known 

35 methods, the complete library made with this method would comprise only 3000 clones (assuming 100% 
efficiency). The number of transformants which must be screened in order to find a specific mutant naturally 
increases considerably if the mutations are not generated randomly. These parameters can be optimized to 
acceptable levels in the method described here. The present invention has clear advantage over the other 
methods, because it enables rapid and efficient mutagenesis of the desired target only, leaving other vector 

40 elements unmutagenized. 

The present invention contemplates a mutagenesis method which allows rapid and efficient 
mutagenesis of defined regions of DNA. In addition, this method results in the rapid and efficient 
mutagenesis of the desired target only, leaving other vector elements unmutagenized. 

Specifically, the method contemplates the introduction of random point mutations to a nucleic acid by 

46 preparing a target nucleic acid in the form of a single stranded template, extending an oligonucleotide 
hybridized to the template to generate a population of DNA molecules, which comprise molecules 
terminating at all possible nucleotide positions within a predetermined region, and misincorporating 
nucleotides opposite to each position on the template. Tne DNA molecules so generated are amplified by 
molecular cloning, and then screened for an altered gene or gene product. 

so The invention also contemplates a method of mutagenesis for preparing novel polypeptides, including 
those not found In nature, and a mutagenesis method for altering the structure and function of nucleic acid 
regions which function in regulating gene expression. 

Figure 1 is a schematic representation of the method for generating a library of mutant genes. 
Mutagenesis of template Ts is shown. A similar strategy is used to mutagenize template A's, C's or G's in 

55 separate series of reactions. The template T positions to be mutagenized are determined by the limit d 
elongation step. The oligonucleotide primer is elongated with polymerase in conditions where the con- 
centration of dATP soon becomes limiting, so that the elongation will stop at different points of sequence 
where an A would be required. After removal of free nucleotides, misincorporation to the 3' ends is carried 
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out using reverse transcriptase and only dCTP,- dGTP and TTP. The molecules are finally elongated to 
completely double stranded forms by using polymerase and alt trie four nucleotides, and the gaps are 
sealed with ligase. 

Figure 1b shows a typical result from the analysis of base-specifically ending elongated primers. The 

5 products from the limited elongation step have been etectrophoresed in 6% sequencing gels. The sizes of 
the fragments are given as the number of nucleotides from the 3* end of the primer. Panel A: The single 
stranded uracil containing M13mp19 template has been elongated with Klenow polymerase using the primer 
GAAGGGCGATCGGTGCG. The limiting nucleotide a -^P-dCTP (diluted with the nonradioactive nucleotide 
to about 30 Cl/mmoi) was used at three different concentrations, 0.075 uM, 0.20 uM and 0.325 uM (lanes 

10 1-3, respectively). Aliquots of the samples were analyzed on sequencing gel. In lanes 4^6 the limiting 
nucleotide was a -^P-dGTP (about 30 Cl/mmol). The marker lanes C and G show the corresponding 
dideoxy sequencing reactions, respectively. ^S-label was used in the marker lanes. Panel B: Template 
DNA, here containing an insert coding for a-amylase from Bacillus stearothermophilus . has been copied 
from two different 20-mer oligonucleotide primers. The limiting nucleotide, a -^S-dATP (diluted to about 300 

is Ci/mmol), was here used at four different concentrations, 0.03 uM, 0.075 uM, 0.20 uM and 0.32 uM 
(increasing concentrations in lanes 1-4 from one primer, and in lanes 5-8 from another primer, respectively). 

Figure 2 shows the region coding for the crfragment of E.coli 0-galactosidase, present in the DNA of 
the phage M13mp19. The sequence to which the specific oligonucleotide primer GAAGGGCGATCGGTGCG 
is hybridized is underlined. The region within parentheses shows the multiple cloning site. 

20 The wild type DNA sequence shown on top of the amino acid sequence has been mutagenized by 
introducing mostly single base substitutions. At least 176 different base substitutions obtained are shown. X 
is a mutation demonstrated by single track sequencing only. 

Figure 3 shows the recombinant phages M13mp18a and M13mp19 a containing the B. stearother- 
mophilus a-amylase gene. 

25 Figure 4 shows the amino acid sequence of Bacillus stearothermophilus a-amylase, using one-letter 

notations for the amino acids. Some of the amino acid changes found by sequencing of the mutants are 
shown on top of the wild type sequence. 

Figure 5 shows the altered pH-dependence of the a-amylase activity of the mutant Tyr 265 -* Ser, as 
compared to the wild type enzyme. 

30 The present invention is a complete mutagenesis method which makes it possible to saturate any 
cloned gene with random mutations. This on the average leads to one amino acid change per mutant of the 
respective protein. Novel mutant enzymes having properties closer to that of the Ideal are found by 
screening clones expressing the mutagenized gene. To permit even elaborate screening procedures, ft is 
important that the wild type background is as low as possible, that the mutagenesis method Is able to target 

3$ the mutations in such a manner that they are distributed evenly within the region of Interest and not 
elsewhere, and that the balance between all the possible 12 base substitutions can be optimized. 

Table 1 shows which amino acid changes can be made to proteins by introducing all possible 574 
single base substitutions to the 64 different codons. Silent mutations with no effect at protein level are 
shown in parentheses. Table 2 summarizes the data of amino acid replacement in Table 1. By using a 

40 suitable screening technique the complete mutagenesis method would have generated and identified the 
following single base mutations: the trypsin with altered substrate specificity (Gly226 — Ala, Craik, C.S., 
Largman, C, Fletcher, T„ Rocznlak, S., Barr, P.J.. Fletterick, R. and Rutter, W.J., Redesigning trypsin: 
alteration of substrate specificity, 228 Science, 291-297 (1985)), the subtilisin mutant resistant to chemical 
oxidation (Met 222 -+ Leu, Estel, DA, Graycar, T.P. and Wells, JA, Engineering an enzyme by site- 

45 directed mutagenesis to be resistant to chemical oxidation, 260 J. Biol. Chem., 6518-6521 (1985)), the 
thermostable tysozyme mutant Lys 54 — Thr, Perry, LJ. and Wetzel, R., Unpaired Cysteine-54 interferes 
with the ability of an engineered disulfide to stabilize T4 (ysozyme, 25 Biochemistry, 773-739 (1986)) and 
the tRNA synthetase mutant with a hundredfold improved affinity for the substrate ATP (Thr51 -* Pro, * 
Wilkinson, A.J., Fersht, A.R., Blow, D.M., Carter, P. and Winter, G. t A large increase in enzyme-substrate 

so affinity by protein engineering 307 Nature, 187-188 (1984)). % 

Table 1 

55 Amino acid replacements obtainedA la 
GCU - (Ala), Val Asp Ser Pro Thr Gly 
GCC — (Ala), Val Asp Ser Pro Thr Gly 
GCA (Ala), Val Glu Ser Pro Thr Gly 



4 



I 



0 285 123 





GCG 


- (Ala), 


Val Glu Ser Pro Thr Gly 




Arg 








CGU 


- (Arg), 


Leu Pr His Cys Ser Gly 


5 


CGC 


- (Arg), 


Leu Pro His Cys Ser Gly Arg 




CGA 


— (Arg)* 


Leu Pro Gin term Glv 

* ■ * • ■ I'll V ▼ 




CGG 


- (Arg). 


Leu Pro Gin Trp Gly 




AGA 


-(Arg), 


term Gly Ser, He Thr Lys 




AGG 


- (Arg), 


Trp Gly Met Thr Lys Ser, 


10 


Asn 








AAU 


-(Asn) 


Asp Tyr His He Thr Ser Lys, 




AAC 


-(Asn) 


Asp Tyr His He Thr Ser Lys, 




Aso 






15 


QAU 


-(Asp) 


Va! Ala Gly Glu, Tyr His Asn 




GAC 


-(Asp) 


Val Ala Gly Glu Tyr His Asn Glu 




Cvs 








UGU 


- (Cys) 


Arg Ser Gly term Trp Phe Ser Tyr 


20 


UGC 


- (Cys) 


Arg Ser Gly Phe Ser Tyr term Trp 




Gin 








GAA 


- (Gin) 


Arg Leu Pro His, term Lys Glu 


25 


CAG 


- (Gin) 


Arg Leu Pro His, term Lys Glu 




Glu 








GAA 


- (Glu) 


Val Ala Gly Asp, term Gin Lys 






\vaiuj 


\/al Ala /^K/ A cr*i Iamw f3ln 1 wo 

vai /via vaiy Asp, ierm uin Lys 


30 


Gly 








GGU 


-(Giy). 


Val Ala Asp Cys Arg Ser 








vaj Mia Msp oys Mrg ©er wiy 




GGA 


- (Gly), 


Val Ala Glu term Arg, 


35 


GGG 


- (Gly). 


Val Ala Glu Trp Arg, 




His 








CAM 




Mrg Leu rro vain, rvsn i yr Asp 




CAC 


- (His) 


Arg Leu Pro Gin Asn Tyr Asp Gin 


40 


lie 








AUU 


- (He), 


Val Phe Leu Thr Asn Ser Met 




AUC 


- (»e), 


Val Phe Leu Thr Asn Ser Met 




AUA 


- (He), 


Val Leu, Thr Lys Arg Met 


46 


Leu 








UUA 


- (Leu), 


He Vai Ser term, Phe, 




UUG 


-(Leu), 


Met Val Ser term Trp Phe, 




CUU 


- (Leu), 


Pro His Arg He Phe Va! 




cue 


- (Leu), 


Pro His Arg He Phe Val Leu 


50 


CUA 


-(Leu), 


Pro Gin Arg He Val Leu 




CUG 


- (Leu). 


Pro Gin Arg Met Val 



Lys 

AAA -* (Lys) Glu term Gin He Thr Arg Asn, 
55 AAG — (Lys) Glu term Gin Met Thr Arg Asn, 
Met 

AUG -> (Val Leu, Thr Lys Arg He, 
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Phe 










UUU 




(Phe) 


Leu, lie Val Ser Thr Cys 




UUC 




(Phe) 


Leu, lie Val Ser Thr Cys 


5 


Pro 










ecu 




(Pro), 


Leu His Ser Thr Arg Ala 




CCC 


~* 


(Pro), 


Leu His Ser Thr Arg Ala Pro 




CCA 




(Pro), 


Leu Gin Ser Thr Arg Ala 




CCG 


™* 


(Pro), 


Leu Gin Ser Thr Arg Ala 


70 


Ser 










UCU 




(Ser), 


Pro Thr Ala Phe Tyr Cys 




UCC 


-* 


(Ser), 


Pro Thr Ala Phe Tyr Cys Ser 




UCA 


•* 


(Ser), 


Pro Thr Ala term, Leu 


75 


UCQ 




(Ser), 


Pro Thr Ala term Leu Trp 




AGU 




(Ser) 


Gly Cys Arg, He Thr Asn 




AGC 




(Ser) 


Gly Cys Arg, lie Thr Asn Arg 




Thr 








20 


ACU 




(Thr), 


Ala Ser Pro lie Asn Ser 




ACC 




(Thr), 


Ala Ser Pro He Asn Ser Thr 




ACA 




(Thr), 


Ala Ser Pro He Lys Arg 




ACQ 




(Thr), 


Ala Ser Pro Met Lys Arg 


25 


Trp 










UGC 




(Arg, Gly Leu Ser term, Cys, 




Tyr 










UAU 




(Tyr) 


His Asn Asp Cys Phe Ser term, 


30 


UAC 


— * 


(Tyr) 


His Asn Asp Cys Phe Ser term, 




Val 










GUU 




(Val), 


Ala Asp Gly He Leu Phe 




GUC 




(VaO, 


Ala Asp Gly He Leu Phe Val 


35 


GUA 




(Val), 


Ala Glu Gly Leu, lie 




GUG 




(Val), 


Ala Glu Gly Met, Leu 




term 










UAA 




(term), 


Gin Lys Glu Leu Ser Tyr, 


40 


UAG 




(term) 


Gin Lys Glu Leu Ser Tyr, Trp 




UGA 




(term) 


Arg, Gly Cys, Trp Leu Ser 



46 



SO 
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2 
6 
2 
2 
2 
2 
4 
2 
3 
€ 
2 
1 
2 
4 
6 
4 
1 
2 
4 
3 



To i 
™> 

AAAACGGGHI 
From:l srsylllil 



n g p 



n u 



L L 
e y 
u s 



M 
e 
t 



P P 

h r 



S 
e 
r 



T T T V 
h r y a 



** 

36 Ala 
18 Asn 
54 Arg 
18 Agp 
18 Cys 
18 Gin 
18 Glu 
36 Gly 
18 His 
27 He 
54 Leu 
18 Lys 

9 Met 
18 Phe 
36 Pro 
54 Ser 
36 Thr 

9 Trp 
18 Tyr 
36 Val 
27 term 



64 576 



t 
e 

r 

m 



12 

2 

18 


2 
2 

2 2 


2 4 

2 

6 2 


2 4 
14 2 


4 

1 4 


4 4 
2 2 
6 2 2 


4 

2 

2 


2 2 

2 
2 


2 

2 

2 


4 2 2 
2 

2 4 


2 2 


2 

2 


4 2 


2 2 
2 2 
2 


2 

4 6 
2 2 


4 2 
2 2 
2 4 


2 2 
2 12 

2 


2 

2 


2 


2 1 


2 2 
4 1 

2 


2 1 
4 

4 2 


2 
2 


2 

2 


6 4 1 
4 18 
1 2 


3 2 
2 6 4 
1 


2 3 
2 1 
2 


3 

6 3 
2 


1 

4 4 


2 

2 


2 


3 2 1 
2 6 
4 


2 

12 


1 

2 

4 4 


1 

2 2 


4 2 6 
4 2 2 
2 


4 
2 


2 
1 


2 2 

3 2 
1 


2 4 
1 4 


14 6 1 
6 12 
1 


2 3 

2 


2 

4 

2 


2 2 
2 

2 2 


2 

2 4 
2 1 


3 5 
3 2 


2 

2 2 


2 

3 2 


2 4 

12 
4 4 
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* number of alternative codons 

** number of different point mutations 

The method comprises the following procedures. Firstly, a genetic library of mutant genes is generated 
in vitro (as shown by Fig. t). Each mutant contains one or two point mutations and the complete library 
contains all of the possible single base mutations of the gene. The mutant library is then expressed In a 
suitable host vector system and mutant genes coding for example for enzymes with improved properties 
are identified on the basis of en2yme activity measurements. These mutants with improved properties can 
then be then remutagenized to isolate even better mutants. This process mimics the evolution of proteins in 
nature, but allows man to determine the precise basis for selection and can be accomplished very rapidly. 

The first step, limited elongation of a primer, which is carried out in four separate reactions, gen rates a 
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population of molecules each copied from the template and comprising molecules terminating at all 
possible nucleotide positions within a predetermined region. The second step, enzymatic m incorporation, 
generates point mutations into this molecular population. It is possible to make all the twelv possible point 
mutations or only a desired subset of them. Any base will be mutag nized t any of the three oth r bases. 

5 and the frequency of the different base substitutions can b optimized. The position and length of the target 
sequence to be mutagenized can be varied. The third step involves completion of the mutagenized 
molecules to forms that can be then amplified and isolated by molecular cloning. 

Mutant proteins with the desired properties are then identifred on the basis of enzyme activity produced 
by the expression of the mutagenized genes in a suitable host. Specific conditions are designed to screen 

10 the transformants for the particular trait in question. For example, if a mutant enzyme with an increased 
thermo stability is screened for, the clones are first made to express the mutant enzymes under optimal 
expression conditions, and then aiiquots of the mutant enzymes produced are subjected to the testing 
temperatures and finally to activity determinations. When a desired clone has been found, the precise 
mutation can be identified by sequencing the mutant nucleic acid \f such Identification is required. 

15 

Example 1: Total Mutagenesis of the q-Fragment of fl-galactosidase from E-coli 

The a-fragment of £-galactosidase is mutagenized to demonstrate how the method works to generate a 

20 library of mutant genes. The protein product can be directly screened for light blue and white mutant 
phenotypes. The region coding for the a-fragment of E. coll 0-gaIactosidase is set forth in Figure 2 and is 
Included in the DNA of the single stranded phage vector M13mp19 as described in Yanisch-Perron, C, 
Vleira, J. and Messing, J., Improved M13 phage cloning vectors and host strains: nucleotide sequences of 
the M13mp18 and pUC19 vectors, 33 Gene, 103-119 (1985), Messing, J. New M13 vectors for cloning, 101 

25 Methods Enzymol., 20-78 (1 983). 

A uracil-containing, single stranded template of M13mp19 phage is prepared essentially as described in 
Kunkel, TA, Rapid and efficient site-specific mutagenesis without phenotypic selection, 82 Proc. Natl. 
Acad. Sci. USA 488-492 (1985). The ung-dut- strain of E. coli, BW313, is grown overnight at 37°C in 20 ml 
of 2 x TY medium. Then 5 ml of the cells are diluted to 200 ml with 2 x TY and grown for 1/2 h. The cells 

30 are infected with M13mp19 phage at the multiplicity of infection of 5, and grown in 200 ml of 2 x TY 
supplemented with 0.25 ug/ml uridine for 4-6 hours at 37°C. The supernatant is used to infect cells again as 
previously described to obtain a second cycle of phage growth. This uracil-containing M13mp19 phage 
grows in the dut-ung- mutant strain BW 313 (typically to a titer of 4 x 10 t0 -1 x 10" plaque forming units/ml) 
but does not grow in the dut+ unji+ strain (typical titer is 5 X 10* -2 X 10*, representing phages lacking uracil 

35 in the genome). 

Single stranded uracil-containing M13mp19 DNA is isolated as follows. The cells grown In 200 ml as 
described are pelleted for 10-20 min by centrrfugation at 5000 rpm In a Sorvall GSA rotor, and the 
supernatant is made 0.5M in NaCI and 8% in polyethylene glycol ("PEG"). After 1 hour at 0 W C It is 
centrrfuged at +4°C for 15 min at 10,000 rpm (GSA), and the phage pellet is dissolved in 30 ml TE. PEG is 
40 added to 2% and NaCI to 0.5 M, and after 1 hour at 0°C the phage is pelleted again by centrifugation for 
10 minutes at 15,000 rpm, +4°C in a Sorvafl rotor SS-34. The pellet is resuspended in 3 ml of 10 mM Trls- 
HCl pH 8.0, 50 mM NaCI, 1 mM EDTA and extracted twice with an equal volume of phenol, then with 
chloroform: isoamylaicohol (24:1) and finally three times with ether. The ONA Is precipitated with ethanol 
and redissolved in TE. 

45 The 17-mer oligonucleotide primer (2 ug) GAAGGGCGATCGGTGCG is 5' end labeled to a low specific 
activity with polynucleotide kinase using -r 32 p-ATP and ATP as described In Maniatis, T., Frrtsch, ER and 
Sambrook, J M Molecular Cloning, Cold Spring Harbor Laboratory (1982), and the primer is purified in a Bio- 
Get P-6 gel filtration column (1 x 20 cm) in TE. The peak fractions of the eluate are combined, made 2 M in * 
ammonium acetate and the DNA Is precipitated with ethanoL 

so The uracil-containing template M13mp19-U (8.5 pmol) and the 17-mer primer (17 pmol) in 30 - 60 ul 15 
mM Tris-HCI pH 8.0, 7.5 mM MgCI, Is heated for 15 min at 80 e C and slowly cooled to about 30°C. The * 
annealed template/primer mixture is diluted with 10 mM Tris-HCI pH 8.0, 5 mM MgCI* to a concentration of 
0.1 pmol template/primer per ul. 

55 
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Limited Elongation 

315 ul of water and 60 ul 100 mM Tris-HCI, pH 8.0, 50 mM MgCI, are added to 60 ul of 
template/prim r (6 pmol), and the sampl is distributed into 12 tubes in 36 ul (0.5 pmol) aliquots. To these 
5 tubes the components are added as follows (either thionucleotide or ordinary nucleotide triphosphates can 
be used): 



Tube number 

10 1 2 3 4 5 6 

0.33 mM CGT mix 8 jul 8 jul 8 /ul 

0.33 mM AGT mix 8 jul 8 jul 8 jul 

rs Limiting Nucleotide : 

5 jul 



20 



25 



46 



60 
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0 


.75 


juM 


dATP 


2 


.0 


iuM 


dATP 


3 


.25 


AIM 


dATP 


O 


.75 


JUM 


dCTP 


2 


.0 


juM 


dCTP 


3 


.25 


flM 


dCTP 



5 jul 



5 fil 



5 pi 



5 ul 



Limiting Nucleotide : 

5 fil 





0.75 


juM 


dGTP 


35 


2.0 


juM 


dGTP 




3.25 


AiM 


dGTP 




0.75 


MM 


TTP 




2.0 


juM 


TTP 


40 


3.25 


jiM 


TTP 



5 jul 

5 pi 

5 jul 



5 pi 



5 jul 



Tube number 
7 8 9 10 11 12 



0.33 mM ACT mix 8 jul 8 jul 8 jul 

30 0.33 mM ACG mix 8 jul 8 jul 8 jul 



5 jul 



where the 0.33 mM CGT mix contains dCTP, dGTP and TTP each at 0.33 mM concentration; 0.33 mM AGT 
mix dATP, dGTP and TTP each at 0.33 mM; 0.33 mM ACT mix dATP, dCTP and TTP each at 0.33 mM; 
and 0.33 mM ACG mix dATP, dCTP and dGTP each at 0.33 mM concentration. 

Finally 1 ul of Klenow polymerase (5 U) is added to each of the tubes 1 - 12, and the samples are 
incubated for 5 min at room temperature. To stop the reactions 3 ul of 0.3 M EDTA is added to each of the 
tubes 1-12. 

The limited elongation can be monitored by electrophoresis of aliquots of the samples on a sequencing 
gel (Sanger et al. f 1977). The labeling of the samples for this purpose is done either by end-labeling the 
primer with T-^p-dATP as described above or by incorporated some a-^p-dNTP (any nucleotide) during 
the limited elongation. A typical result from a limited elongation is shown in Fig. 1b. The extent of limited 
elongation is regulated by the concentration of the limiting nucleotide. Thus the resulting populations of 
extended molecules can be adjusted to have the 3' ends at variable distances from the primer. Under the 
conditions described here, the combined molecular populations cover at least 200 bp from the prim r. 

After the limited elongation step the contents of tubes 1 - 3 ar combined and labeled as A- ( likewise 
those of tubes of 4 - 6 (C-), 7 - 9 (G-) and 10 - 12 (T). 12 ug carrier glycogen is added to each sample, and 
the samples are extracted twice with phenol, twice with ether, made 250 ul in 2M ammonium acetate and 
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precipitated with 650 ul of 94% ethanol. The pellets are dissolved in 200 ul TE. 50 ul 10M ammonium 
acetate is added and the precipitation st p r p at d. After thre successlv precipitations to efficiently 
remove free nucleotides, the samples are washed with 70% ethanol, dried briefly and dissolved in 10 mM 
Tris-HCI pH 8.3. 0.1 mM EDTA. 



Misincorporatlon 

The population of molecules in tube A-consists of primers elongated to a variable degree and 

10 hybridized to the template so that each 3' end is next to a position where the template contains a T. See 
Figure 1. When these elongated primers are now further elongated with a reverse transcriptase in the 
presence of dCTP, dGTP and TTP but in the absence of the correct nucleotide dATP, wrong nucleotides 
will be Incorporated instead of the correct one. Since the correct nucleotide Is always known, the efficiency 
of the method can be increased by omitting that nucleotide from the misincorporatlon reaction. Klenow 

75 polymerase and the corresponding trinucleotides can also be used. The misincorporatlon step using three 
wrong nucleotides is carried out for each of the four samples A-, C-. G-and T-. The concentrations of the 
"wrong" nucleotides are optimized to obtain good equilibrium between the various misincorporations. 

4.6 ul of 5x reverse transcriptase buffer [250 mM Tris-HCI, pH 8.3 (at 37°C when diluted to 50 mM), 30 
mM MgCI„ 300 mM.KCI], 0.25 ul 10 mM dCTP, 0.5 ul 10 mM dGTP, 0.25 ill 10 mM TTP, 0^5 ul 0.1 M 

20 dithiothrettol, 1.25 ui 1.75 mg/ml bovine serum albumin and 0.5 ui reverse transcriptase (12-25 U/ul) are 
added to 15 ul of DNA in tube A-, containing about 0.5 pmol of template/elongated primer. The sample Is 
incubated at 37°C or at 42° C for 3 hours, and an additional 0.5 ul of reverse transcriptase is added after 1 
1/2 h. After 3 hours of reaction 1.25 ul of a mixture containing all the four nucleotides at 10 mM 
concentration and 0.2 ul of reverse transcriptase are added and incubated at 42°C for 30 min. The sample 

25 is then coded to 0°C. To elongate the newly synthesized strand to completion, the following components 
are added to each of the samples A-, C-, G-and T-: 1 ul 10 mM ATP, 5 ul of 5x reverse transcriptase 
buffer, 16 ul water, 2.5 ul T4 figase (6 U/ul), 2 ul p32 (protein p32 from phage T4, 2.5 mg/ml) and 0.5 ul of 
T4 DNA-polymerase (10 U/ul). The samples are incubated at 14°C overnight or for a few hours. 
Alternatively, the final elongation step can be performed with 0,7 ul Klenow polymerase (5U/ul, Boehringer) 

30 in the conditions described above, but leaving out T4 DNA polymerase and p32. 

Competent JM109 cells (see, Hanahan, O., Techniques for transformation of E coli. In: DNA cloning, 
volume I (Glover, D.M., ed.), IRL Press, Oxford and Washington, pp. 109-136 (1985)) are transformed with 
the above samples using 0.3 ng, 3 ng and 50 ng of DNA per transformation. Control transformations are 
done with the single stranded uracif-containing template, and with both single stranded and double stranded 

35 wild type M13mp19 template. The cells are grown on plates containing IPTG and X-gai (as in Messing, 
1983 (see above)). Typical transformation frequencies of the samples are around 10* transformants/ug, the 
uracii-template alone gives 10-60 transfbrmants/ug, the single stranded wild type M13mp19 about 10 s /Ug 
and the double stranded wild type M13mp19 typically 5*10' -l x 10//ug. 

Some of the mutations made to the a-fragment of the EL coli 0-galactosidase result in the formation of 

40 functionally defective 0-galactosidase and thus lead to a white or light blue phenotype of plaques on IPTG 
and X-gal containing indicator plates. Blue plaques represent either wild type a-fragment sequences, silent 
mutations where a nucleotide change does not lead to an amino acid change, or mutations where the 
resulting amino acid does not affect the activity of the enzyme towards synthetic substrate X-gai. In order to 
characterize the precise distribution, frequence, nature and equilibrium of the mutations obtained, several 

45 phage DNAs from the white, light blue and blue plaques from various mutagenesis experiments are isolated 
and sequenced using the same primer as for the limited elongation step. 

The mutations in the DNA isolated from the white and light blue plaques show which codon changes 
lead to functional changes In the enzyme. PhenotypicaJly silent mutations from the blue plaque DNAs are 
distributed essentially In a random fashion. Fig. 2 shows the target a-fragment sequence and the various 

so base substitutions found in the mutants. The primary target covered about 100 nucleotides from the primer. 
The region in parentheses Is the polylinker region of M13mp19. In several experiments the phenotyplc 
change was selected for, which explains the enrichment of certain point mutations. AH the twelve possible 
types of point mutations have been obtained, namely: 
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Altogether 108 different nucleotides in th target DNA have been changed, and because more than one 
type of mutation has been obtained at many positions, the total number of different mutations obtained is at 
least 176. In an optimized system this result is achieved in one experim nt, where the limited elongation, 
misincorporation and complete elongation steps can be performed in two days, and mutants obtained on 
the third day. 



Table 3 



Mutagenesis of template residues T, A # C and G 



Template 
residues 
mutagenized 



Misincorpora- 
tion conditions 



Yield of mutants 
{from sequencing 
data) 



Average No. 
of mutations 
per template 



Ratio of different 
mutations 



(A-reaction) 



dCTP 100 uM 
dGTP 100 uM 
TTP 100 uM 



dCTP 100 UM 
dGTP 0*2 uM 
TTP 100 



90 % 



total: 3.0 
blue: 2.4 

white 3.6 



T -> 



G 
C 
A 



white 
1 
33 
1 



42 % 



total: 1.4 
blue: 1.3 

whites 2.3 



blue white 
T -> G 5 4 
C 10 20 
A3 8 



(T-reaction) 



dATP 100 uM 
dCTP 100 uM 
dGTP 100 uM 



27 % 



total: 1.1 
blue: 1.1 
white: 1.6 



blue white 
A -> T 15 30 
G 9 32 
C 2 17 
(26 
deletions) 



(G-reaction) 



dATP 100 uM 
dCTP 100 uM 
TTP 100 uM 



dATP 20 >iM 
dCTP 150 uM 
TTP 50 uM 



59 % 



total: 1.7 
blue : 1.7 
white 1.6 



blue white 
C -> T 14 15 
GOO 
A 3 9 



ND 



total: ND 
blue : ND 
white : 1.8 



C -> T 
G 
A 



white 
12 
1 
21 



(C-reaction) 



dATP 100 urn 
dGTP 100 uM 
TTP 100 uM 



dATP 150 uM 
dGTP 100 uM 
TTP 0.2 uM 



74 % 



total: 2.6 
blue: 2.4 
white: 3.8 



G -> T 
G 
A 



0 
1 
58 



9 % 



total: ND 
blue : ND 
white: 1.2 



G -> T 
C 
A 



white 
5 
26 
8 
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The equilibrium between the different point mutations is altered by changing the concentrations of the 
"wrong" nucleotides in the misincorporation reaction as shown in Table 3. Table 3 characterizes the 
mutagenesis in resp ct of mutant yields, average number of base substitutions per template, and the ratio 
of different mutations when the three misincorporating nucleotides are used simultaneously at various 



11 



0 285 123 



concentrations. The G/T and T/G mismatches are form d very easily, and to avoid multiple mismatches the 
concentrations of the corresponding misincorporating nucleotides must be very low. A correlation is found 
between the total yield of mutant clon s and the number of base substitutions in an average mutant. The 
number of mutations per tern plat can be regulated by adjusting the nucleotide concentrations at the 

s misincorporation step as shown In Table 3. 

When the frequency of mutants is around 60%, each mutant template contains on the average 1.8 base 
substitutions. Since about one fourth of all possible base substitutions to the amino acid codons are silent, 
and thus preserve the original amino acid, the value above corresponds to only slightly more than one 
amino acid substitution per molecule on the average. This is close to optimal, considering that several 

70 amino acid changes will be conservative mutations which do not cause much difference in the properties of 
the polypeptide. 

The position and length of the target region to be mutagenized can be varied in a controlled manner. 
This is done by regulating the extent of limited elongation, simply by increasing (to mutagenize sequences 
further away from the primer) or decreasing (to make mutations closer to the primer) the concentration of 
75 the limiting nucleotide and then combining these aliquots. 



Example 2: Complete mutagenesis of the a-amylase from Bacillus stearothermophilus 

20 The a-amylase gene from B. stearothermophilus ATCC 12980, originally cloned in EMBL3 vector, is 
subcloned as a 1.9 kb fragment from a partial Sau3A digest, first to the BamHI site of pUC8 and then as a 
Sma l-Hindlli fragment to the phage vectors M13mp18 and M13mp19. Figure 3 shows the constructions 
obtained, M13mp18a and M13mp19a. The 1.9 kb long insert contains 234 nucleotides from the 5' 
noncoding promoter region of the a-amylase gene, the complete coding region and 13 nucleotides of the 3' 

26 noncoding region. Since the sequence of the insert is known, synthetic 20-mer oligonucleotides com- 
plementary to the target sequence at about 200 nucleotide intervals are used as primers. Both strands can 
be mutagenized separately. 

Uracil-containing single-stranded M13mp18a and M13mp19a are prepared by the method described in 
Example 1. The oligonucleotide primers mentioned above are annealed to the uracfl-containlng template as 

30 described in the previous example. The limited elongation, misincorporation and aiMhe-way-around elonga- 
tion are performed as described. Either uracil-containing M13mp18 or or M13mp19 a can be used as a 
template, or they can both be used as templates in separate tubes and together with their own primers. If 
both strands are mutagenized, the method is even more random and the balance between the different 
base substitutions is better. A suitable ung+ dut+ strain of E. coll, such as JM 109, is transfected with the 

35 mutagenized DNA and the ceils are plated on glucose minimal medlum/H-top amylopecttc azure (0.25%, 
Calbiochem) plates, with or without IPTG at 480 ug/plate. Expression of functional recombinant a-amylase 
t>y E. coli is detected as halos around the plaques. 

JM109 is grown overnight at 37°C In 2 x TY. 250 rpm, and diluted 1:100 in 2 x TY, This suspension is 
pipetted in 200 ul aliquots to the wells of sterile microliter plates (Sterilan). The ceils in each well are 

40 infected by a single plaque picked from the plates with sterile toothpicks. The microtiter plates are 
incubated in a humid chamber at 37° C, with shaking at 250 rpm, for 18 h. 

The a-amylase activity (Bemfeld, P., Amylases, a and 0 . 1, Methods Enzymol., 149-158, 1955) is 
determined by pipetting to empty microtiter plates (Nunc): 90 ul substrate solution, containing 1.1% 
Zulkowsky soluble starch in 0.2M sodium acetate buffer of the desired pH, 5 mM CaCI* 50 mM NaCI and 

45 10 ul of suitably diluted enzyme-containing sample from the overnight incubation on a microtiter well. After 
incubation at 65°C for 15 min, 150 ul dinitrosalicylate (DNS) is added and the samples are incubated for 10 
min at IOO°C (vapour of boiling water), cooled to room temperature and the absorbance at 540 nm is 
measured in a Multiskan MCC/340 spectrophotometer (Labsystems) against a zero-sample with 10 ul of 
water replacing the enzyme-containing sample. In a control sample the DNS reagent (150 ul) is added to 90 

so ul of substrate solution and the enzyme sample is added last. 

The wild type B. stearothermophilus a-amylase has an optimum pH of 5.5 and a temperature optimum 
of about 80° C. Screening for mutant enzymes with higher or lower activity than the wild type is done by 
determining the activities at pH 5.5, 65°C as described above. Screening for mutant enzymes with 
increased acitivity at pH 3.5 requires that the pH at the activity determination is 3.5 and it is achieved by 

65 adjusting the pH of the substrate solution. Mutants with increased thermostability are screened for by 
including an incubation step at a high temperature (such as 95°C for 1 hour) before the determination of a- 
amylase activities under optimal conditions. 

An alternative method, or a method to b used for preliminary scr ening for mutant amylases is a 
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replica plating test. E coii JM109 transfected with mutagenized M13mp18a or M13mp19a is grown on 
glucose minimal/H-top plates supplemented with 0.25% amylopectin azure (Calbiochem). After the forma- 
tion of plaques and production of a-amylase, a dry, sterile Whatman 1 paper is tightly pressed on the plat 
to trap the enzyme, and then the paper is transferred into good contact with a substrate plate and incubated 
5 at 65 °C for 2 or more hours. For screening of cr-amylases of increased activity at pH 3.5, the substrate plate 
contains 0.25% soluble starch (Zulkowsky) or insoluble starch (Maizena) on 1% agarose plate In pH 3.5 
buffer. 

Mutant clones are also characterized 'by sequencing to identify the mutation. Rgure 4 shows the 515 
amino acids long protein sequence of B. stearothermophiius a-amylase, and some of the amino acid 

10 replacements obtained from the mutagenesis are shown on top of the wild type sequence. These mutants 
have been identified from partial mutant libraries generated from particular primers. 

Several of the mutants obtained have diminished activity, and some of them are inactive. The method 
can thus also be used for mapping of the functional regions of proteins. For example, the mutants lie 354 -* 
Leu. Tyr 370 -* Ser. He 428 — Phe. Ser 435 — Phe. among several others, are Inactive towards starch. 

15 One particular mutant Tyr 285 -* (Ser. has an altered pH-behaviour as shown in Figure 5. The activities 
have been measured as described earlier, at 65°C In sodium acetate buffers of pH 3.5 - 6, and in PIPES 
buffers (piperazine - N, N'-bis [2-ethane sulfonic acid]) In the pH range 6-8. Compared to the activity of the 
wild type enzyme (100% activity), the mutant enzyme shows about 52% activity in the pH range 5.0 - 8.0, 
but at pH 4.5 it has only about 33% and at pH 4.0 about 1 1% of the activity of the wild type enzyme. This 

20 demonstrates that the activity versus pH curve can be altered by a single point mutation. 

The nucleic acid template for mutagenesis can also be RNA. In that case a complementary primer is 
hybridized to the RNA, the limited elongation and misincorporation steps are performed as described but 
using reverse transcriptase instead of Klenow enzyme in the elongation, and then double stranded cDNA is 
made using RNase H and DNA polymerase I as described by Teeri, T., Kumar, V., Lehtovaara, P. and 

25 Knowles, J., Construction of cDNA libraries by blunt end ligation: high frequency cloning of long cDNAs 
from filamentous fungi, 164 Anal. Biochem., 60-67 (1987). The double stranded cDNA is ligated to a suitable 
vector and expressed in a suitable host. Any mutation generated in the first strand of cDNA will be copied 
to the second strand, so that each of the progeny strands will contain the mutation. 

In place of the specific oligonucleotide primers used for limited elongation in this example, it is possible 

30 to use only one primer. If the sequence of the target is unknown, the primer can be hybridized to the 
bordering vector sequences. Limited elongation can be carried out using suitable concentrations of a 
particular limiting nucleotide to produce families of molecules of defined length, for example 200, 400, 600. 
800 etc. nucleotides. The elongation is monitored by electrophoresis of radioactively labeled samples 
together with size markers on sequencing gels or alkaline agarose gels (as described in Maniatis et al, 

35 1982), and the desired fractions are combined. A second limited elongation step is then carried out as 
described in Example 1 In four separate reactions, each with a different limiting nucleotide. 

Alternatively, the misincorporation step can be carried out in the presence of one nucleotide or 
thionucieotide only. The Klenow polymerase or any other polymerase can be used instead of reverse 
transcriptase if thlonucleotides are mislncorporated. The final elongation step, which gives molecules able to 

40 transform the host, can also be modified In any of the following ways: 

(1) Any polymerase, for instance the Klenow enzyme, can be used instead of T4 polymerase. If T4 
polymerase is not used, the p32 protein is unnecessary. 

(2) Additional primers can be hybridized to the vector sequences before the final elongation step to 
increase the amount of full length molecules. 

45 (3) After the misincorporation, the final elongation step can be replaced by hybridization of the uracil 

template to Its complementary strand, obtained by restriction enzyme cleavage of the vector replicative 
form. The remaining gaps are filled in with polymerase and sealed with ligase. 

The foregoing general discussion and experimental examples are intended to be illustrative of the 
present invention, and are not to be considered as limiting. Other variations within the spirit and scope of 

eo this Invention are possible and will present themselves to those skilled in the art. 



Claims 

55 1 . A method for Introducing random point mutations to any nucleic acid, characterized by 

(a) preparation of the target nucleic acid to be mutagenized in the form of a single stranded template; 
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(b) extension of an oligonuc! otid hybridized to this template to generate a population of DNA 
molecules, each copied from the template, and comprising molecules terminating at all possible nucleotide 
positions within a predetermined region; 

(c) mislncorporatlon of nucleotides opposit to each position of the template using polymerase and 
5 the variable 3* ends generated In (b) as primers; 

(d) completion of the DNA molecules generated in (c) to forms that can be amplified by molecular s 
cloning in a suitable host; 

(e) expression of the mutagenlzed DNA using a suitable host vector system allowing the screening 
for an altered gene or gene product 

10 2. The method of claim 1 , characterized in that the target nucleic acid codes for a polypeptide. 

3. The method of claim 1, characterized in that the target nucleic acid contains a region which 
functions in regulating gene expression. 

4. A method for preparation of a novel polypeptide including those not found in nature, characterized 

by 

75 (a) preparation of the target nucleic acid coding for the polypeptide to be mutagenlzed in the form of 

a single stranded template; 

(b) extension of an oligonucleotide hybridized to the template to generate a population of molecules, 

each copied from the template, and comprising molecules terminating at ail possible nucleotide positions 

within a predetermined region; 
20 (c) reincorporation of nucleotides opposite to each position of the template using polymerase and 

the variable 3' ends generated in (b) as primers; 

(d) completion of the molecules generated in (c) to forms that can be amplified by molecular cloning 
in a suitable host; 

(e) expression of the mutagenlzed nucleic acids, using a suitable host vector system allowing the 
25 screening for an altered gene product; 

(f) screening for a clone expressing a mutant polypeptide which functions better than the un- 
mutagenized polypeptide under the desired conditions. 

5. A method for altering the structure and function of nucleic acid regions which function in regulating 
gene expression, characterized by 

30 (a) preparation of the target nucleic acid in the form of a single stranded template, containing the 

regulatory region to be mutagenized and a region not to be mutagenlzed, coding for an easily quantitated 
gene product whose expression levels correlate to the structure of the regulatory region; 

(b) extension of an oligonucleotide hybridized to the template to generate a population of molecules, 
each copied from the template, and comprising of molecules terminating at all possible nucleotide positions 

35 within a predetermined region; 

(c) misincorporation of nucleotides opposite to each position of the template using polymerase and 
the variable 3* ends generated in (b) as primers; 

(d) completion of the molecules generated in (c) to forms that can be amplified by molecular cloning 
in a suitable host; 

40 (e) expression of the mutagenlzed nucleic acids, using a suitable host vector system allowing the 

screening for a clone where the mutagenized gene regulatory region functions better than the un- 
mutagenized regulatory region under the desired conditions. 
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the amount of carrier "12 ug u should be "2 jig". 

On page 21, in column T the amount of TTP should be "100 pM" 
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On page 28, line 8, "DNA molecules" should be amended to "molecules". 
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@ A method for complete mutagenesis of nucleic acids. 

@ The Invention relates to methods for introducing all types 
of base substitution mutations randomly to arty nucleic acid. 
The methods comprise 

(a) preparation of the target nucleic acid to be 
mutagenlzed in the form of a single stranded template; 

(b) extension of an oligonucleotide hybridized to this 
template to generate a population of molecules, each 
copied from the template, and comprising molecules 
terminating at all possible nucleotide positions within a 
predetermined region; 

(c) misincorporation of nucleotides opposite to each 
position of the template using polymerase and the variable 
3' ends generated In (b) as primers; 

(d) completion of the molecules to forms that can be 
amplified by molecular cloning in a suitable host; and 

(e) expression of the mutagenlzed DNA using a suitable 
host vector system allowing the screening for an altered 
gene or gene product 
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